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(54) Internet information access system 



(57) An audio interface to the Internet operated by a user 
over a telephone system. The interface generates user 
interpretable audio output (speech and other sounds) and 
interprets user generated audio input (speech and/or 
DTMF tones) of control information which is used to 
control the information transfer process. The speech 
recognition system of the interface uses a dynamically 
constructed list of recognisable phrases based on the 
contents of the current page. Additional pages are 
generated internally to give the user additional control 
over the information transfer process. 
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Internet Information Access System 

This invention relates to an audio interface to the Internet. 

The Internet is a global network of computers allowing access to millions of pages of 
information in a variety of formats. Each page of information held on a computer may 
contain links to other pages either on the same computer or on other computers. The 
information on the Internet may be accessed using an Internet browser application. 
The browser application can display the pages of information visually for the user and 
navigate through the Internet from page to page at the users instruction. A page of 
information typically correspond to a Web page, an Internet News article, or an Email 
message. A variety of protocols are used by the World Wide Web browser to access 
the different types of pages. 

To access the Internet a user typically needs his/her own computer with a browser 
application and a physical connection to the Internet. This means access to the 
massive quantity of information on the Internet is currently restricted to a relatively 
small proportion of the population. 
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According to the present invention there is provided an audio interface to the Internet 
which generates user interpretable audio output of information and interprets user 
generated audio input of control information which controls the transfer of pages of 
information over the Internet. 



A specific embodiment of the invention will now be described by way of example 
with reference to the accompanying diagram (Figure 1) in which the high level 
components are clearly illustrated. 
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The user (1) communicates using a telephone (2) by speaking in a normal voice and/or 
using the DTMF tones generated by pressing the keys on the telephone. Information 
retrieved from the Internet is supplied to the user as speech and other audio sounds. 
Different voices or other audio sounds are used to enable the user to distinguish 
between standard text and links to other pages of information on the Internet. 

The computer interface to the telephone network (4) is implemented using suitable 
existing technology. The audio signals that are received by the computer interface to 
the telephone network (4) from the telephone network (3) (i.e. that are generated by 
the user (1) and telephone (2)) are passed on to the speech and tone recognition 
system (5). In the reverse direction the audio signals that are generated by the speech 
and tone generation system (6) are passed on by the computer interface to the 
telephone network (4) to the telephone network (3) so that the user (1) can listen to 
them. 

The speech and tone recognition system (5) is based on existing technology which is 
enhanced to incorporate dynamic phrase list recognition. Dynamic phrase list 
recognition works as follows. For any particular page of Internet information there are 
a finite number of links that the user may want to select. The dynamic phrase list 
contains the links in the current page of information. The dynamic phrase list can be 
used to simplify the implementation of the speech recognition system. When the user 
speaks the speech recognition system can choose the phrase in the dynamic phrase list 
which most closely matches its interpretation of the speech. The dynamic phrase list is 
supplied by the navigation control system (7). 

The speech and tone generation system (6) is based on existing technology. The 
speech and tone generation system (6) is supplied a text representation of the current 
page by the page processing system (8) and converts it into speech and other audio 
sounds. Different voices or other audio sounds are used to distinguish between 
standard text, links to other pages on the Internet, and other attributes of the text such 
as italic and bold. 
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The navigation control system (7) processes input from the speech and tone 
recognition system (5) which it uses to control the movement of the Internet browsing 
system (9) from one page to another. The navigation control system (7) also supplies 
the speech and tone recognition system (5) with the dynamic phrase list using the text 
representation of the current page produced by the page processing system (8) and an 
internal history of pages processed so far. 

The page processing system (8) processes pages supplied by the Internet browsing 
system (9). Each page typically corresponds to a Web page, an Internet News article, 
or Email message. The page processing system applies a series of content sensitive 
filtering, reformatting and conversion operations to convert the page into a text 
representation that is more suitable for sequential reading. Typical operations include 
converting date and time information into a simple and consistent format, and 
identifying and extracting the page specific content from the page's general header 
and footer information which is less interesting to the user. The text representation of 
the page produced by the page processing system (8) is used by the navigation control 
system (7) and the speech and tone generation system (6). 

The Internet browsing system (9) implements a variety of protocols for transferring 
pages over the Internet including World Wide Web pages, Internet News articles and 
Email messages. In addition it may generate pages of information internally to give 
the user additional control of the information transfer process. For example, internal 
pages may be generated to aid the user to construct a reply to an Email message which 
the Internet browsing system (9) will then send over the Internet using an appropriate 
protocol. 

The computer interface to the Internet (10) is implemented using suitable existing 
technology. 
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Claims 



1. An audio interface to the Internet which generates user interpretable audio output 
of information and interprets user generated audio input of control information 
which controls the information transfer process. 

2. An audio interface to the Internet as claimed in Claim 1 wherein the user generated 
audio input of control information may be normal speech and/or DTMF tones. 

3. An audio interface to the Internet as claimed in Claim 1 or Claim 2, wherein the 
generated user interpretable audio output of information may include speech. 

4. An audio interface to the Internet as claimed in Claim 3 wherein different voices or 
other audio sounds axe used to enable the user to distinguish between standard text 
and links to other pages of information on the Internet. 

5. An audio interface to the Internet as claimed in Claim 2 or Claim 4, wherein the 
speech recognition process is enhanced by using a dynamic phrase list which 
consists of the phrases that the user can issue to control information transfer 
process given the current page of information. 

6. An audio interface to the Internet substantially as described herein with reference to 
the accompanying illustration (Figure 1). 

7. An audio interface to the Internet substantially as described herein with reference to 
the accompanying illustration (Figure 1) where the type of information that can be 
transferred is limited to World Wide Web pages. 

8. An audio interface to the Internet substantially as described herein with reference to 
the accompanying illustration (Figure 1) where the type of information that can be 
transferred is limited to Email pages. 

9. An audio interface to the Internet substantially as described herein with reference to 
the accompanying illustration (Figure 1) where the type of information that can be 
transferred is limited to Internet News pages. 
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